Perception Score: A Learned Metric for Open-ended Text Generation Evaluation
نویسندگان
چکیده
Automatic evaluation for open-ended natural language generation tasks remains a challenge. We propose learned metric: Perception Score. It utilizes pre-trained model and considers context information conditional generation. Score assigns holistic score along with the uncertainty measurement. conduct experiments on three two unconditional tasks. achieves state-of-the-art results all consistently in terms of correlation human scores.
منابع مشابه
Affect Detection from Open-Ended Improvisational Text
We report progress on adding affect-detection to a program for virtual dramatic improvisation, monitored by a human director. We have developed an affect-detection module to control an automated virtual actor and to contribute to the automation of directorial functions. The work also involves basic research into how affect is conveyed through metaphor. The relevance of the project to the sympos...
متن کاملMachine Translation Evaluation Metric for Text Alignment
As plagiarisers become cleverer, plagiarism detection becomes harder. Plagiarisers will find new ways to obfuscate the plagiarized passages so that humans and automatic plagiarism detectors are not able to point them out. So, a plagiarism detection system needs to be robust enough to detect plagiarism, no matter what obfuscation techniques have been applied. Our system attempts to do the same b...
متن کاملClosing in on open–ended patient questionnaires with text mining
Knee injury and Osteoarthritis Outcome Score (KOOS) is an instrument used to quantify patients' perceptions about their knee condition and associated problems. It is administered as a 42-item closed-ended questionnaire in which patients are asked to self-assess five outcomes: pain, other symptoms, activities of daily living, sport and recreation activities, and quality of life. We developed KLO...
متن کاملExploitation In Affect Detection In Open-Ended Improvisational Text
We report progress on adding affectdetection to a program for virtual dramatic improvisation, monitored by a human director. We have developed an affect-detection module to control an automated virtual actor and to contribute to the automation of directorial functions. The work also involves basic research into how affect is conveyed through metaphor. The project contributes to the application ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i14.17526